[Infer] Change groupwise weight quant from cpu to gpu for deepseek_v2 model #10174

zeroRains · 2025-03-18T07:05:39Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

Others

PR changes

Others

Description

移除deepseek-v2使用group-wise weight quant时使用的CPU kernel，修改为使用GPU Kernel并添加单测。

paddle-bot · 2025-03-18T07:05:44Z

Thanks for your contribution!

codecov · 2025-03-18T07:40:46Z

Codecov Report

Attention: Patch coverage is 0% with 32 lines in your changes missing coverage. Please review.

Project coverage is 46.97%. Comparing base (759ae99) to head (97ed730).
Report is 103 commits behind head on develop.

Files with missing lines	Patch %	Lines
.../experimental/transformers/deepseek_v2/modeling.py	0.00%	17 Missing ⚠️
...erimental/transformers/fused_transformer_layers.py	0.00%	8 Missing ⚠️
...dlenlp/experimental/transformers/qwen2/modeling.py	0.00%	7 Missing ⚠️

❌ Your patch check has failed because the patch coverage (0.00%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project check has failed because the head coverage (46.97%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #10174      +/-   ##
===========================================
- Coverage    48.66%   46.97%   -1.70%     
===========================================
  Files          768      799      +31     
  Lines       127103   132266    +5163     
===========================================
+ Hits         61860    62137     +277     
- Misses       65243    70129    +4886

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…nto wq

…u kernel

…nto wq

github-actions · 2025-07-17T00:26:42Z

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动，被标记为stale。

change groupwise weight quant from cpu to gpu for deepseek_v2 model

a63fa39

paddle-bot bot added the contributor label Mar 18, 2025

paddle-bot bot assigned wawltor Mar 18, 2025

zeroRains mentioned this pull request Mar 18, 2025

[Inference] Support group-wize quantize for weight_quantize op in GPU PaddlePaddle/Paddle#71549

Open

zeroRains added 4 commits March 27, 2025 06:01

fix conflict

e789537

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

e16f4e2

…nto wq

support group-wise weight quant for qwen2 and change cpu kernel to gp…

8d0a39a

…u kernel

support moe group_wise weight quant

1aae565

zeroRains force-pushed the wq branch from bbd42d3 to 1aae565 Compare April 20, 2025 09:16

zeroRains added 4 commits April 20, 2025 09:19

fix

09ab2f3

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

821e4d6

…nto wq

add the iterscale for fused_moe

ed47c7e

fix the conflict

bdf7fe0

yuanlehome self-requested a review May 13, 2025 03:20

fix the search call

97ed730

github-actions bot added the stale label Jul 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Infer] Change groupwise weight quant from cpu to gpu for deepseek_v2 model #10174

[Infer] Change groupwise weight quant from cpu to gpu for deepseek_v2 model #10174

Uh oh!

zeroRains commented Mar 18, 2025

Uh oh!

paddle-bot bot commented Mar 18, 2025

Uh oh!

codecov bot commented Mar 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

Uh oh!

[Infer] Change groupwise weight quant from cpu to gpu for deepseek_v2 model #10174

Are you sure you want to change the base?

[Infer] Change groupwise weight quant from cpu to gpu for deepseek_v2 model #10174

Uh oh!

Conversation

zeroRains commented Mar 18, 2025

Before submitting

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Mar 18, 2025

Uh oh!

codecov bot commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

Uh oh!

codecov bot commented Mar 18, 2025 •

edited

Loading